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ABSTRACT 


Multiyear estimation of at-harvest stratum crop proportions has been 
shown previously to be more efficient than single year estimation. In this 
report, the behavior of multiyear estimators :Ln the presence of missing 
acquisitions is studied. Only the (worst) case when a segment proportion 
cannot be estimated for the entire year is considered. The effect of these 
missing segments on the variance of the at-harvest stratum crop proportion 
estimator is considered when (i) missing segments are not replaced, (ii) 
missing segments are replaced by segments not sampled in previous years. 

The principle recommendations from this study are (1) to replace missing 
segments according to some specified strategy and (2) to use a sequential 
procedure tor selecting a sampling design ; i.e., choose an optimal two year 
design and then, based on the observed two year design after segment losses 
have been taken into account, choose the best possible three year design having 
the observed two year parent design. 


Missing Observations in Multiyear Rotation Sampling Designs 


1. Introduction 

Previous work (c.f., Technical Reports 18 and 19) has shown that 
multiyear estimation produces crop proportion estimates which are more 
efficient than single year estimators. Reductions in the variance of 
the estimated at -harvest acreage can exceed 30% for two and three year 
designs. These results, however, are based on the availability of esti- 
mated segment proportions for each sampled segment in each ''season". 

Since cloud cover, mechanical malfunctions, registration problems, etc. 
will in practice make some estimated proportions unavailable in some 
seasons, it is necessary to determine the effect of missing segment 
observations on the efficiency of multiyear rotation designs. 

Missing acquisitions can lead to one of the following three situations. 

(i) Individual acquisitions are missing but a sufficient number are 
available to produce the required segment proportion estimates for each 
season. The quality of the estimates may be reduced. 

(ii) The pattern of missing acquisitions precludes some but not all 
season estimates. For example, cloud cover may make an early season 
estimate impossible but mid and late season estimates can be produced. 

(iii) The pattern of missing acquisitions is such that segment pro- 
portion estimates are unavailable for the entire year. 

From the sampling design standpoint, situation (iii) is the worst 
possible case and will be the only one dealt with in this report. 


mi 
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The basic analysis of variance mixed model being considered is given 
matrix form by 

Y - X 

where Y « vector of transformed segment proportions , 

a - vector of (fixed) year effects , 

5 ■ vector of (fixed) systematic season biases » 

B * vector of (random) segment effects , 

X * fixed effects design matrix f 
U =* random effects design matrix » 
e * vector of errors. 

Since the variance of the transformed proportion is a function of the true 
segment proportion , a weighted least squares estimation procedure is used. 
Details concerning this model and the following remarks on parameter esti- 
mation can be found in Technical Reports 18 and 20. 

Primary interest centers about the estimation of and var (a^) since 
the estimated at-harvest stratum proportion for the current year T is given 

A ^ A St 

by Pj * y (a^) . The variance of a^, , the estimated effect of the current year, 
is the (T,T)^ entry in the matrix 
(X' VT V _1 WX) -1 o 2 , 

E 

where W - matrix of weights , 

e = We = vector of transformed errors. 




+ JB + Ie 


V - I + y WUUTT , 

2 . 2 

* * °b 1 °E ‘ 

The choice of a good design reduces to choosing a design matrix U which 
minimizes var(a^) . 


i 

i 

I 

i 
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For the unweighted model where W Is the Identity matrix, two and 
three rotation year designs can be characterised by a sequence of non- 
negative integers “[X^, Xj.... X^] , where 

Xj = numoer of segments used in first year only , 

X 2 =* number of segments used in first and second years only , 

X^ ■ number of segments used in second year only 

X^ - number of segments used in first and third years only , 

X 5 - number of segments used in all three years . 

Xg * number of segments used in second and third years only , 

= number of segments used in third year only. 

* * * it 

For two year designs, we can abbreviate X to X ■ [X^, X^ ]. 

Numerous examples of rotation designs and their X - sequences are given 
in Technical Report 18. 

In Sections 2 and 3 we consider the effect of missing segment obser- 
vations for the entire year. In Sections 4 and 5 we study one of many 
possible replacement strategies for the missing segments. Section 6 
contains recommendations on the choice of a design based on the work in 
the previous sections. 

2. Non-Replaced Segment Losses 

In our study of the effect of missing segment proportion estimates 
for the entire year, we shall assume that the initial (complete) design 
has the same number of segments. R. in each year. In addition, the weight 
matrix W will be the identity matrix. We will consider segment losses only 
in the last year of the design. Since probabilities are not computed in 
this part of the study, these segment losses need not be independent. We 
shall assume that in the last year, lost segments which had not been sampled 
in any previous year will be replaced by a comparable new segment while 


segments which had been sampled in previous years will not be replaced if they 

it 

are lost. Thus, for two year designs, only X^ -type segments can be missing, 
while for three year designs, X^, X^, and X^ -type segments can be missing. 

A 

If all segments in the last year are missing, then varCa^) cannot bo estimated. 
Thus, in what follows, we shall always retain at least one segment in the last 
year. 

The effect of segment loss in the current (last) year will be measured by 

A 

var (a ) 

C - C(y, R, X) - . 

2 

a 

e 

2 

The value of C is the coefficient of a in the expression for var (a^) . A 
small value of C indicates a good design. 

All two and three year designs for selected numbers of segments per year, R, 
were enumerated. For each such design, all possible combinations of missing 
segments were generated and the value of C was computed for selected variance 
component ratios y. The values of C are given in Appendices 1 and 2. The 
scope of each appendix is given in Table 1. The range of y values includes 
Table 1. Scope of Appendices 1 and 2 

Number of Years 

Appendix in the Design R y 

1 2 2, 3, 4, 5 0.5,1,2,3,4,10,50,100 

2 3 2,3,4 0.5,1,2,3,4,10,50,100 

2 

cases where the variation among segments, a, , was smaller, equal to and 

D 

2 

greater than the weighted error variance . In these appendices, the number 
of segments lost is given rather than the percentage lost. Since we are 
dealing with very few segments per year, percentages would not be comparable 
for different values of R and conclusions concerning the effect for a range 
of percentages would generally be restricted to a single value of R. 


3. Conclusions for Non-replaced Segment Losses 


From the Information contained in Appendix 1, the following inclusions 
were obtained for two year designs. 

(1) For a fixed number of lost segments , as the variance component 
ratio y increases, the effect of segment loss as measured by C becomes 

greater. Thus, as the variability among segments increases, the loss of 

A 

a segment leads to a more pronounced Increase In the estimate of var (a^) . 

(11) For fixed y , as the number of segments lost Increases , C increases. 

(Ill) For a fixed y and number of segments lost, as the amount of overlap 

(segments used in both years) in the complete design decreases, the value of 

C decreases. Thus, designs with a minimal amount of overlap would appear to 

be less affected by segment loss. However, the loss of only a portion of the 

* 

overlap segments (Xj - type segments) has a small effect relative to the large 

effect of losing all overlap segments. This holds even when the number of 

* 

remaining segments (X^ * type segments) is large. An example of this is shown 

in Table 2 for R * 3 and selected y . The table entry is the value of C. The 

designs are shown in Figures 1-3. This latter observation indicates that a 

design with a large initial overlap would be preferable since there would very 

likely be overlap segments remaining after missing segments were accounted for. 

Thus, the choice of a design in the face of segment loss must be a compromise 

between these two conflicting conclusions. This need to compromise leads us 

to conclude that good two year designs are ones which contain a modest amount 
* 

of overlap (Xj - type segments) . Such designs were also recommended in Tech- 
nical Report 18 as being near optimal in terms of efficiency. 
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Table 2. Values of C for two year designs having R * 3. 
^ Number of Y 

X Missing Segments 0.5 1.0 10.0 100.0 


030 

0 

0.389 

0.556 

3.556 

33.556 


1 

0.500 

0.675 

3.687 

33.689 


2 

0.800 

1.000 

4.048 

34.055 

121 

0 

0.366 

0.496 

2.755 

25.257 


1 

0.493 

0.646 

2.955 

25.465 


2 

1.000 

1.500 

10.500 

100.500 

212 

0 

0.362 

0.481 

2.328 

20.337 


1 

0.550 

0.800 

5.300 

50.300 


Figure 1^ 

Figure 2 


Figure 3 
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Year 

Year 

Segment 

1 2 

Segment 1 

2 

Segment 1 

2 

1 

X * 

1 X 


1 X 


2 

X * 

2 X 

* 

2 X 


3 

X X 

3 X 

* 

3 X 

* 



4 

X 

4 

X 





5 

X 

* 


* 


* 

1. 2) 

X * 

(0, 3, 0) 

X - (1, 

2, 1) 

X - (2, 


From the information contained in Appendix 2, the following conclusions 
were obtained for three year designs. 

(i) For a fixed total number of missing segments, as the variance com- 
ponent ratio y increases, the effect of segment loss Increases; i.e., C 
increases. 

(ii) For a fixed y» as the total number of segments lost increases, C 
increases. 

These first two conclusions parallel those for two year designs and deal 
only with the total number of segments lost. It is important to know what 


t An X indicates that the segment was observed In that year and a * 
Indicates overlap segments which can be missing. 
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effect, if any, the pattern of visaing segments has on the estimation of 

/\ 

var (cij) . To address this question, the following definition is needed. 

Definition: A rotation design is said to be disconnected if the set of 

segments in any year consists entirely of segments which are observed in 
that year only. 

This definition is a special case of a more general definition of 
connectedness in the experimental design literature. As examples, the designs 
in Figures 4 and 5 are disconnected while the design in Figure 6 is connect- 
ed (R - 3). 

In terms of X - sequences, a rotation design is disconnected if and only 

if A^ - R (equivalently, A^ ■ A^ ■ A^ - 0) or A^ ■ R (equivalently, 

A^ ■ A^ ■ A^ ■ 0) or A^ ■ R (equivalently, A^ ■ A,. « A^ - 0) , where R 

is the number of segments observed in the corresponding year. For designs 

with missing segments, the equivalent characterisations are more useful in 

conjunction with Appendix 2 for determining disconnectedness since the 

A - sequences for designs with missing segments are not explicitly given in 

their entirety. The necessary portions of the A - sequence can be constructed 

by observing that the loss of a ^ - type segment in the third year increases 

the number of A^ - type segments by one. A similar relationship holds between 

A, and A. - type segments and between A, and A. - type segments. For example, 
b l o j 

for the design A * (0, 2, 0, 1, 0, 1, 1) , if one A^ - type segment and one 

A, “ type segment are missing, then the resulting design has a A - sequence 
o 

A ■ (1,2, 1,0, 0,0,1) , which indicates a disconnected design. 


t 
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Figure 4 


Figure 5 

Figure 6 
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X 
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3 
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3 

X 

X 

3 
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X 

4 


X 

X 

4 


X 

4 


X 

i 

5 


X 

X 

5 


X 

5 


X 

i 

6 


X 

X 

6 


X 

6 


X 

X 

X - (3,0,0 

,0. 

0.3,0,) 

X - (0, 

0,3 

.3, 0,0,0) 

X - (1,0 

. 2 . 

2.0 

.1.0) 


(iii) For a fixed total number of missing segments, the X - category 
or categories from which the segments were lost (X^, X^* or X^) has very 
little effect on the value of C unless the lost segments produce a design which 
is disconnected. If a disconnected design results, it is common for the value 
of C to be nearly twice as large as the corresponding values of C when the same 
number of segments are removed without disconnecting the design. This result 
holds even when the number of remaining segments is large* This conclusion is 
illustrated in Table 3 for the design shown in Figure 6. The corresponding 
incomplete rotation designs are ^hown in Figures 7-10 (Note the changes in the 
X - sequences.). The lesson to be learned from Table 3 is that the loss of one 
crucial segment can produce more damage than the loss of several less crucial 
segments in the current year. 


Table 3. An illustration of the effect of disconnectedness on C. 


Figure 

Number of 
Missing Segments 

Number 

\ 

Lost 

X 5 

in 

X 6 

0.5 

Y 

1.0 

10.0 

100.0 

6 

0 

0 

0 

0 

0.306 

0.402 

1.929 

16.935 

7 

1 

0 

0 

1 

0.450 

0.625 

3.637 

33.639 

8 

1 

1 

0 

0 

0.400 

0.500 

2.024 

17.027 

9 

2 

1 

0 

1 

0.729 

0.929 

3.977 

33.SS5 

10 

2 

2 

0 

0 

0.729 

0.929 

3.977 

33.983 



Figure 7 
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Segment 

1 

2 

3 
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5 
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Year 


12 3 , 

X 

X ’ X 

X X 

X 
X 
X 


X - (1,0, 3,2, 0,0,0) 


Figure 8 
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X 

X X 

X 
X 

X X 


X - (2, 0,2,1, 0,1,0) 



Figure 9 


Figure 10 


Year 


Year 


Segment 

12 3 

Segment 

1 2 

3 

1 

X 

1 

X 


2 

X 

2 

X 


3 

X X 

3 

X 


4 

X 

4 

X 


5 

X 

5 

X 


6 

X 

6 

X 

X 

X - (2,0,3 

,1,0, 0,0) 

X « (3,0, 

2, 0,0, 1,0) 


(iv) 

For a given 

number of segments per 

year R, each 

two year parent 

produces desigrs which 

are reasonably robust 

to segment loss provided that 


the missing segments do not produce a disconnected design. 

(v) In light of the difficulties which result from a disconnected 
design, a good choice of a complete design would be one in which there 
is sufficient overlap so that the chance of having a disconnected design 
after lost segments have been accounted for is small. This recommendation 
is compatable with the choice of optimal or near optimal designs as deter- 
mined by the efficiency considerations in Technical Report 18. 



Y, 
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4. Replacement Strategies 

There are many possible replacement strategies for missing segments 

which could be implemented. Several are outlined and discussed below for 

the case when segments are missing for the entire year. 

(I) Replace any missing segment in the current year by a segment which has 

A 

not been sampled in previous years; i.e., by a ^ - type segment for two year 
designs. and a X^ - type cegment for three year designs. 

(II) If possible, replace a missing segment by a segment which would not change 

* 

the A sequence of the design. If this is impossible, then replace it by a X, 
or -type segment. For example, in the design shown in Figure 11, either 

segment 1 or segment 2 can be used in the third year as the X^ - type segment. 

Hence, if segment 2 is requested in the third year but proportion estimates 

cannot be made, then it would be replaced by segment 1. In contrast, the loss 

of segment 4 or 5 in the third year would necessitate the introduction of a 

new sixth segment into t' * design. 

Figure 11 

Year 

12 3 


Segment 

1 

2 

3 

4 

5 


X 

X 

X X 
X 
X 


X 

X 


X - (1,1, 0,1, 0,2,0) 

(III) Modify strategy II to allow for the replacement of a missing segment 
in the current year by any previously sampled segment which is not already 
allocated to the ./rent year. If sufficient segments were not available as 
replacements, then X^ - type segments would be added. For this strategy, in 
the '^slgn in Figure 11, segments 1 and 3 could be used as replacements for 
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any of segments 2, 4> or 5 which were missing in the third year. A new 
segment would need to be added only if all three third year segments were 
missing. 

The advantage of strategies such as II and III is that the modified 
designs are less likely to be disconnected than are those resulting from 
strategy I It is clear from previous work that within strategy III* the 
use of some segments as replacements would result in better designs than 
for other available segments. The determination of the actual optimal 
replacement segment would be extremely complicated and would be best 
dealt with on a case by case basis when the need arises. The difficulty here 
is that the modified design can be any of the possible designs which arise 
from the parent design. In contrast* strategy I has the advantage of being 
simple to implement. 

The preceeding remarks deal exclusively with unweighted estimation where 
all segments within a X - category are interchangeable. For weighted esti- 
mation, the loss of different segments within the same X - category will have 

A 

differing effects on the estimation of var(a^) and so cannot be treated as 
interchangeable. This further complicates strategies II and III and, in fact, 
the optimal segment replacement could only be determined by using knowledge of 
the particular segments sampled in previous years. For these reasons, we shall 
only discuss replacement strategy I. 

In what follows, we shall be dealing with unweighted estimation only and 
segment loss will occur only in the last year of the design. We shall assume 
that segment losses are independent (in contrast to Section 2) and that the 
probability of loss for each segment is the same. Under these assumptions, 
the number of segments X^, missing of type X^ has a binomial distribution. 
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For three year designs, the joint distribution of (X^, X^, X^) is the product 
of three independent binomial distributions each having the same probability of 
"success". 

Comparisons of designs under this replacement strategy will be affected 
by computing 

C * C (p, if) * I £ C , 

V V V 


where 


f “ f^(p) ~ probability that the replacement design will 
be used, 

p = probability that any one particular segment will be lost, 

C^= C^(y) = value of C for the replacement design for a 
given value of y » 

and where the sum is taken over all possible replacement designs for the 
design under study. The design which results from applying tha replacement 
strategy to the original design after the missing segments have been determined 
is referred to as the replacement design. 

Thus, we are comparing weighted averages of possible C -values. Small 

a 

values of C indicate designs for which var (a^) will, on the average, be 
relatively small. 

Values of C were computed for all two and three year designs with the 
values of R and y indicated in Table 1 and the probability p of segment 

loss being 0.1, 0.2 0.9 . Appendix 3 contains the C values for two 

year designs while the corresponding information for three year designs is 
in Appendix A. Three year disconnected designs in which ■ R and two 
year disconnected designs were not considered since for these designs C 
is independent of p. This follows from the fact that the only replacement 
design is the design itself. 
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5. Conclusions for Replacement Strategy 1. 

From the information contained in Appendix 3 the following conclusions 
were obtained concerning the behavior of two year designs under renlacement 
strategy I . 

(i) For a fixed probability of segment loss, p, as the variance com- 
ponent ratio y increases, C increases. 

(ii) For a fixed number of segments per year, R, and a fixed y, as p 
increases the number of segments sampled in both years (X^ - type segments) 
for the best design (minimum C ) increases. This is consistent with previous 
results since, for a fixed design, increasing p will increase the probability 
that the replacement design will be disconnected , thereby contributing a 
large C - value with large probability to the calculation of C . 

Hence, for large segment loss probabilities p, a good design will have a 
large number of segments sampled in both years while for small p , a good 
design will have a minimal amount of overlap. As a result, the choice of 
an optimal or near optimal design under this replacement strategy requires 
some knowledge of the probability of segment loss. Near optimality may be 
obtained without precise knowledge of p. 

(iii) As R increases, C decreases for a fixed p and y. Thus, an 
increase in the number of segments per year will, on the average, reduce 

A 

var (c^). 

(iv) For a given design and value of Y» C is a polynomial in p of 
* 

degree where the coefficients are functions of the C - values of the 
possible replacement designs. This follows from the fact that C is a 
weighted average of C values where the weights are binomial probabilities 
in terms of p. These polynomials are not in general monotonic. For a 
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given y, the value of C may be large for values of p close to 0 and 1 but 
smaller for intermediate values of p. Such an example is given in Table 4 which 
contains the values of C for the design X ■ (2, 3, 2) . For this design* C is 
a cubic polynomial in p. 


Table 4. Values of C for the design X * (2, 3, 2) . 

Y 


p 

0.5 

1.0 

10.0 

100.0 

0.1 

** 

0.218* 

0.293** 

1.544 

13.941 

0.2 

0.218* 

0.292* 

1.510 

13.521 

0.3 

0.218** 

0.293* 

1.491* 

13.243* 

0.4 

0.219 

0.294 

1.490* 

13.166* 

0.5 

0.221 

0.297 

1.512 

13.346 

0.6 

0.222 

0.302 

1.563 

13.842 

0.7 

0.22.' 

0.307 

1.645 

14.711 

0.8 

0.227 

0.314 

1.765 

16.011 

0.9 

0.230 

0.323 

1.926 

17.799 


* values of p and y for which this design is optimal 
** other designs are also optimal for this combination of p and y 


An intuitive interpretation of the behavior of C in Table 4 is difficult at 
best. However, some light can be shed on its behavior by studying the C - values 
for possible replacement designs since the coefficients in the polynomial expression 
for C are functions of these C - values. The C - values corresponding to the 
y - values in Table 4 are given in Table 5. 


Table 5. Values of C for possible replacement designs 
associated with the design X ■ (2, 3, 2) . 


X of Replacement Y 



Design 

0.5 

1.0 

10.0 

100.0 

2 

3 

2 

0.218 

0.294 

1.587 

14.445 

3 

9 

3 

0.217 

0.288 

1.433 

12.686 

4 

1 

4 

0.221 

0.296 

1.360 

11.377 

5 

0 

5 

0.233 

0.333 

2.133 

20.133 
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For a given y, C is a weighted average of these C - values. As p increases, 
the probability that the replacement design will be the disconnected design 
X * (5, 0, 5) increases. Thus C will increase. For intermediate values of p, 
more weight is placed on replacement designs with smaller C - values, thereby 
reducing C. 

The conclusion which can be drawn here reinforces that in (ii), viz., when the 
between segment variation is large (large y) , some knowledge of the probability of 
segment loss, p, is essential in determining a design which can be expected to 
perform well under this replacement strategy. 

A list of optimal or near optimal two year designs under this replacement 
strategy is given in Table 6. A striking feature of this list is that every 
design which is not disconnected appears in it for some configuration of values 
of p and y, again emphasizing the need for some knowledge of these parameters. 

An indication of the range of values of C for the optimal and near optimal 
designs in Table 6 for selected values of p and y is given in Table 7. The for- 
mat of this table has been rearranged to that the effect of changing R, the number 
of segments per year, is easily seen. It is important to realize that the design 
for each entry varies. Thus, for a given R, no single design can achieve the indi- 
cated values of C for all p and y . 

From the information in Appendix 4, the following conclusions were obtained 
concerning the behavior of three year designs under replacement strategy I. 

(i) For a fixed probability of segment loss p, the values of C increase as 
the variance component ratio y increases. 

(ii) For a fixed y, C is a polynomial in p of degree X^ + X^ + X^ where 
the coefficients are functions of the C - values of the possible replacement 
designs. As a result, the behavior of C for three year designs is similar to 
that described previously for two year designs. 
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Table 6. Optimal and near optimal two year designs. 




Range of parameters where 

\ 


* 

design is optimal 


R 

X 

P 

Y 


2 

0 2 0 

0.5 - 0.9 

all 

\ 


111 

0.1 - 0.5 

all 


3 

0 3 0 

0.6 - 0.9 

all 

- 


12 1 

0.3 - 0.5 

all 

1 



0.1 - 0.2 

0.5 



2 12 

0.1 - 0.2 

1.0 - 100.0 

| 

4 

0 4 0 

0.6 - 0.9 

all 

j 


13 1 

0.5 

all 



2 2 2 

0.2 - 0.4 

all 




0.1 

0.5 - 3.0 



3 13 

0.1 

4.0 100.0 


5 

0 5 0 

0.7 - 0.9 

all 




0.6 

0.5 - 3.0 



1 4 1 

0.5 

0.5 - 3.0 




0.6 

4.0 - 100.0 



3 2 3 

0.1 

0.5 - 10.0 




0.2 - 0.3 

all 



4 14 

0.1 

50.0 - 100.0 



k 
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Table 7. Minimum values of C for two year designs. 


p 

2 

3 

4 

5 



yjL 

0.5 

• 

0.1 

0.546 

0.366 

0.272 

0.220 

0.3 

0.554 

0.366 

0.276 


0.6 

0.564 

0.372 

0.276 

0.220 

0.9 

0.576 

0.381 

0.284 

0.230 



XJ 1 

X 0 


0.1 

0.736 

0.489 

0.364 

0.290 

0.3 

0.756 

0.741 

0.368 

0.295 

0.6 

0.780 

0.768 

0.376 

0.295 

0.9 

0.814 

1.140 

0.400 

0.315 



XX 

10.0 


0.1 

3.912 

2.451 

1.804 

1.425 

0.3 

4.228 

2.649 

1.876 

1.465 

0.6 

4.576 

2.796 

1.952 

1.530 

0.9 

5.050 

3.237 

2.348 

1.825 



XX 

100.0 


0.1 

35.416 

21.660 

15.636 

12.255 

0.3 

38.732 

23.937 

16.660 

12.805 

0.6 

42.378 

25.455 

17.880 

13.575 

0.9 

47.350 

30.120 

21.660 

16.670 
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(111) As the number of segments sampled per year Increases , C generally 
decreases for a fixed p and y. A few exceptional cases can be found. Thus It 

is advantageous to Increase the number of segments sampled per year. 

(iv) For a given two. year parent design, as, the probability of segment loss 

p increases, the minimum values of C occur for designs with a decreasing number 
of new segments in the third year (X^ - type segments); i.e., an increasing number 
of segments sampled in previous years should be resampled in the third year. For 
a fixed value of X^ within a parent, designs having more segments sampled in all 
three years (X^ - type segments) have smaller values of C. 

(v) For a fixed R, each two year parent design will produce a three year 

design with reasonable (but not necessarily near optimal) values of C. The 

best three year designs result from two year parents with a minimal amount of 
overlap (X^ - type segments), which are not near optimal two year designs. 

A list of optimal or near optimal three year designs is given in Table 8 
for replacement strategy I. The final choice of a three year design from among 
those in Table 8 must take into account the performance of the two year parent 
design since at-harvest estimates will be required during each of the three 
years that the design is in use. Thus, disconnected two year parents should not 
be used even if they yield the smallest C - value for a particular combination 
of p and y in the third year. For example, in Table 8 for R ■ 2, the design 
X “ (1, 0, 1, 1, 0, 1, 0) actually has smaller values of C for p .3 than 
does X - (1, 0, 1, 0, 1, 0, 1), but the difference is relatively small. Since 
the former design has a disconnected two year parent, the latter would be pre- 
ferred in most cases. 

In general, the value of y does not seriously affect the optimality or near 


optimality of a particular design in Table 8. 
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R 


Table 8. Optimal and near optimal three year designs. 
* 

X X Range of p 


0011100 

111 

0.4 - 0.9 

1000110 

111 

0.4 - 0.9 

1011010 

202# 

0.1 - 0.3 

1010101 

111 

0.1 - 0.3 

1011110 

212 

0.4 - 0.9 

1021101 

212 

0.1 - 0.3 

2010111 

212 

0.1 - 0.3 

2021011 

303# 

0.1 

1122011 

313 

0.3 - 0.9 

2111021 

313 

0.3 - 0.9 

2022020 

404# 

0.2 - 0.9 

3020112 

313 

0.1 - 0.2 

3031012 

404# 

0.1 


# indicates disconnected two year parent. 


An indication of the amount by which the value of C varies for these 
near optimal designs in Table 8 for selected y and p is shown in Table 9. 

As in Table 7, the format has been rearranged so that the effect of changing 
R, the number of segments sampled per year, is easily seen. It is important 
to realize that the designs which produce these values of C vary with p and y. 
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Table 9. Minimum values of C for three year designs. 

R 


2 


3 


4 


Y - 0.5 


0.1 

0.468 

0.309 

0.228 

0.3 

0.480 

0.312 

0.232 

0.6 

0.494 

0.324 

0.240 

0.9 

0.517 

0.342 

0.256 


Y - 1.0 


0.1 

0.606 

0.402 

0.296 

0.3 

0.640 

0.417 

0.304 

0.6 

0.687 

0.441 

0.328 

0.9 

0.749 

0.492 

0.368 


Y - 10.0 


0.1 

3.034 


1.832 

1.332 

0.3 

3.429 


2.025 

1.420 

0.6 

3.906 


2.247 

1.656 

0.9 . 

4.843 


3.063 

2.296 




Y - 100.0 


0.1 

27.112 


15.792 

11.172 

0.3 

31.107 


17.490 

12.024 

0.6 . . 

35.714 

j ^ 

19.770 

14.032 

0.9 

45.645 


28.488 

21.184 



6 . Recommendat Ions 


In this report we have studied, in effect, two strategies for dealing 
with segment .loss: no replacement and replacement by new segments. The 

principal recommendation is that some replacement strategy should be used. 

The non-replacement of lost segments, for whatever reason, can be disastrous 
in many cases. 

If the situation should arise where lost segments cannot be replaced, the 
following guidelines should be used in choosing a design. 

(i) A two year design having a "moderate" number of segments sampled in 
both years should be chosen. "Moderate" must be interpreted in light of the 
likelihood that a segment will be lost. As this likelihood Increases, the 
number of segments sampled in both years shculd increase. A design with 
complete rotation of segments should never be used. In extreme situations 
where large numbers of lost segments are almost certain, a no rotation design 
(the same segments sampled in both years) 'rould be recommended. This should 
not be construed as a blanket recommendation for the no rotation design since 
it performs poorly from an efficiency viewpoint. 

(ii) A three year design should be chosen which has a "large" proportion 
of the segments sampled in more than one year. The two year parent of the 
chosen design should follow recommendation (i) . As in (i) , the interpretation 
of "large" must be made in light of the likelihood of segment loss. Also as in 
(i), the no rotation design should be used only in extreme cases. It is 
generally not recommended. 
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The thrust of the above recommendations is to choose designs which avoid 
disconnectedness as Well as . possible subject to non-design considerations. 

The effect of estimating at-harvest stratum proportions from a disconnected 
design was illustrated in Table 3 but cases where the effect Is much more 
dramatic can be found in Appendices 1 and 2. 

The replacement strategy studied in Sections 4 and 3 is one of many which 
could be implemented. It was initially chosen for its simplicity. However, 
among the strategies mentioned in Section 4, it has the greatest tendency to 
produce a disconnected replacement design. In this sense, it represents a 
worst possible strategy; that is, the effect of segment loss should be more 
pronounced under strategy I than under strategies II and III. 

The key to the following recommendations for design selection under strategy 
I is a preliminary knowledge of the probability of segment loss p and the variance 
component ratio y . For hear optimality, this knowledge need not be precise; a 
reasonably narrow range of possible parameter values will suffice. 

(i) For two year designs the optimal or near optimal designs are „iven 
in Table 6. Although for some values of p and y two year no-rotation designs 
are listed, if the design is to be extended beyond two years the fact that 
these designs produce offspring which, as a group, are not near-opcimal must 
be considered. 

(il) For three year designs, the optimal or near optimal designs are 
given in Table 8. The selection of a design from this table must be tempered 
by the fact that the design must be used to produce at-harvest stratum proportion 
estimates at the end of each year. Thus, three year designs generated from a 
two year parent having no overlapping segments are not recommended when other 
designs are available. 
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It was noted In Section 5 ;hat each two year parent produces several 
reasonable but not necessarily optimal three year designs. In view of this 
fact and the somewhat contradictory results In Tables 6 and 8 t the following 
sequential procedure is recommended for choosing designs under replacement 
strategy I. 

(ill) Choose an optimal two year design based on Table 6 and knowledge 
of the values of p and y. At the end of the second year, use Appendix 4 to 
find an optimal or near optimal three year design for the two year parent 
design which has been observed. Because of segment loss, this observed parent 
may well differ from the design originally selected. 

Although the three year design resulting from recommendation (ill) probably 
will not be among the optimal designs listed in Table 8, it should perform 
reasonably well for both the second and third "ears . The recommendation 
(ill) is based on a preference for two reasonable estimates rather than a 
combination of one near optimal and one possibly poor estimate. 


Appendix 1: Values of C for Two Year 

Designs , R ® 2 i 3| 4 i 5 


The table entry Is the value of 


var(oO 

C - 


for the indicated value of y and the indicated number of missing 

* * 

- type segments from the original design with the given X -sequence 

(e.f. , Section 2). The X _ Rt >quence of the design with missing segments 

* 

can be reconstructed from the X - sequence of the original design by 

* 

subtract trig the number of missing segments from X^ and adding it to 

. * * 

Xj . For example, if the original design were X • (1* 3* 1) and 

* 

two segments were missing, the resulting design would be given by X * (3* 1» 1)* 
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Appendix 2: Values of C for Three Year 


Designs, R * 2, 3, 4. 


The table entry is the value of 


A 

var (a ) 

C = 1 

a 2 

£ 

for the indicated value of y and configuration of missing segments from 
the original design. The X - sequence of the design with missing segments 
can be reconstructed from the X - sequence of the original design. An example 
of such a reconstruction is given in Section 3. The three year designs are 
are arranged according to their two year parent design. The value of SS is 
the number of segments in the parent design. 
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Appendix 3: Values of C for Two Year 

Designs , R » 2, 3, 4, 5. 


The table entry is the value of 


C*[f C 

.. v v 




where the sum is taken over all possible replacement designs under replacement 
strategy I for the design under study (c.f., Section 4). The value of 
computed using the indicated variance component ratio y and probability of 
segment loss p. 
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Appendix 4: Values of C for Three Year 

Designs, R ■ 2, 3, 4. 


The table entry is the value of 


where the sum is taken over all possible replacement designs under 
replacement strategy I for the design under study. The value of C 
is computed using the indicated variance component ratio y and 
probability of segment loss p. The three year designs are grouped 
according to their two year parent design. The value of SS is the 
number of segments in the parent design. 
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